Classification of Genes for Disease Identification Using Data Mining Techniques
نویسنده
چکیده
Scientists are nowadays providing great awareness about microarray gene expression dataset. Researches tell that data mining fails to recognize the most important biological associations between genes. Recently biological information mining using clustering techniques were used for the analytical evaluation of gene expression. There are many challenges exist in the existing methods. Optimization Algorithm for multi dimensional search space does not provide relational optimization result on varying gene expressional problems. The existing method solves the clustering problem, but bi-cluster based gene expression information was not extracted. A key point on existing work was to handle multi modal structure optimization problems with effective searching process, but it does not offer relational sequence optimized result on the associated gene data. To overcome the above issues, the proposed research is developed. The objective of the proposed research is to extract the biological information and identify the relational sequences on gene expression to identify abnormal genes. The proposed techniques like biological process on physiological data, PCPHC and Biclustered Ant Optimized Feature Relational Sequencing extract the biological process information from gene expression datasets. These techniques are tested on various bench marked datasets called Cancer Gene Expression datasets Broad Institute repository for experimental evaluation of the proposed method with an existing method, which identifies and extracts the hidden information from datasets. Finally, the gene patterns were verified as normal or abnormal on the basis of simple pattern matching process.
منابع مشابه
Identification of Fraud in Banking Data and Financial Institutions Using Classification Algorithms
In recent years, due to the expansion of financial institutions,as well as the popularity of the World Wide Weband e-commerce, a significant increase in the volume offinancial transactions observed. In addition to the increasein turnover, a huge increase in the number of fraud by user’sabnormality is resulting in billions of dollars in lossesover the world. T...
متن کاملIdentification of Fraud in Banking Data and Financial Institutions Using Classification Algorithms
In recent years, due to the expansion of financial institutions,as well as the popularity of the World Wide Weband e-commerce, a significant increase in the volume offinancial transactions observed. In addition to the increasein turnover, a huge increase in the number of fraud by user’sabnormality is resulting in billions of dollars in lossesover the world. T...
متن کاملUsing Combined Descriptive and Predictive Methods of Data Mining for Coronary Artery Disease Prediction: a Case Study Approach
Heart disease is one of the major causes of morbidity in the world. Currently, large proportions of healthcare data are not processed properly, thus, failing to be effectively used for decision making purposes. The risk of heart disease may be predicted via investigation of heart disease risk factors coupled with data mining knowledge. This paper presents a model developed using combined descri...
متن کاملDetecting Diseases in Medical Prescriptions Using Data Mining Tools and Combining Techniques
Data about the prevalence of communicable and non-communicable diseases, as one of the most important categories of epidemiological data, is used for interpreting health status of communities. This study aims to calculate the prevalence of outpatient diseases through the characterization of outpatient prescriptions. The data used in this study is collected from 1412 prescriptions for various ty...
متن کاملIdentification of Alzheimer disease-relevant genes using a novel hybrid method
Identifying genes underlying complex diseases/traits that generally involve multiple etiological mechanisms and contributing genes is difficult. Although microarray technology has enabled researchers to investigate gene expression changes, but identifying pathobiologically relevant genes remains a challenge. To address this challenge, we apply a new method for selecting the disease-relevant gen...
متن کاملDetecting Diseases in Medical Prescriptions Using Data Mining Tools and Combining Techniques
Data about the prevalence of communicable and non-communicable diseases, as one of the most important categories of epidemiological data, is used for interpreting health status of communities. This study aims to calculate the prevalence of outpatient diseases through the characterization of outpatient prescriptions. The data used in this study is collected from 1412 prescriptions for various ty...
متن کامل